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Description 

This Invention relates to recombinant DNA 
technology. It particularly relates to the applica- 
tion of the technology to the production of human 
serum albumin (HSA) in microorganisms for use 
in the therapeutic treatment of humans, in one 
aspect the invention relates to a technique for 
producing DNA sequences encoding desired 
polypeptides. In another aspect it relates to the 
construction of microbial expression vehicles 
containing DNA sequences encoding a protein, 
e.g. human serum albumin or the biologically 
active component thereof operably linked to 
expression effecting promoter systems and to the 
expression vehicles so constructed. In another 
aspect, the present invention relates to microor- 
ganisms transformed with such expression 
vehicles, thus directed in the expression of the 
DNA sequences referred to above. In yet other 
aspects, this invention relates to the means and 
methods of converting the end product of such 
expression to entities, such as pharmaceutical 
compositions, useful for the therapeutic treat- 
ment of humans. In preferred embodiments, this 
invention provides for particular expression vec- 
tors that are sequenced properly such that mature 
human serum albumin is produced directly. 

In one aspect, the present invention is particu- 
larly directed to a method of preparing cDNA 
encoding polypeptides or biologically active por- 
tions thereof. This aspect provides the means and 
methods of utilizing synthetic primer DNA corre- 
sponding to a portion of the mRNA of the 
intended polypeptide, adjacent to a known 
endonuclease restriction site, in order to obtain 
by reverse transcription a series of DNA frag- 
ments encoding sequences of the polypeptide. 
These fragments are prepared such that the entire 
desired protein coding sequence is represented, 
the individual fragments containing overlapping 
DNA sequences harboring common endonuc- 
lease restriction sites within the corresponding 
overlapping sequence. This aspect facilitates the 
selective cleavage and ligation of the respective 
fragments so as to assemble the entire cDNA 
sequence encoding the polypeptide in proper 
reading frame. This discovery permits the obten- 
tion of cDNA of high molecular weight proteins 
which otherwise may not be available through 
use of usual reverse transcriptase methods and/or 
chemical synthesis. 

The publications and other materials hereof 
used to illuminate the background of the inven- 
tion, and in particular cases, to provide additional 
details respecting its practice are incorporated 
herein by reference, and for convenience, are 
numerically referenced in the following text and 
grouped in the appended bibliography. 

(A) Human serum albumin 

Human erum albumin (HSA) is the major pro- 
tein species in adult serum. It is produced in the 
liver and is largely responsible for maintaining 
normal osmolarity in the bloodstream and func- 



tions as a carrier for numerous serum molecules 
(1, 2). The apparent fetal counterpart of HSA is a- 
feto protein and studies have been undertaken to 
compare the two as well as rat serum albumin 

s and a-fetoprotein (3 — 8). The complete protein 
sequence of HSA has been published (9 — 12). The 
published protein sequences of HSA disagree in 
about 20 residues as well as in the total number of 
amino acids in the mature protein [584 amino 

w acids (9); 585 (12)]. Some evidence suggests that 
HSA is initially synthesized as a precursor 
molecule (13, 14) containing a "prepro" 
sequence. The precursor forms of bovine (15) and 
rat (16) serum albumin have also been 

75 sequenced. 

The role or rationale for the use of albumin in 
therapeutic application is for the treatment of 
hypovolemia, hypoproteinemia and shock. Albu- 
min currently is used to improve the plasma 

20 oncotic (colloid osmotic) pressure, caused by 
solutes (colloids) which are not able to pass 
through capillary pores, inasmuch as albumin has 
a low permeability constant, it essentially confi- 
nes itself to the intravascular compartment. When 

25 different concentrations of nondiffusable particles 
exist on opposite sides of the cell membrane, 
water crosses the partition until the concen- 
trations of particles are equal on both sides. In 
this process of osmosis, albumin plays a vital role 

30 in maintaining the liquid content in blood. 

Thus, the therapeutic benefits of albumin 
administration reside primarily for the treatment 
of conditions where there is a loss of liquid from 
the intravascular compartment, such as in surgi- 

35 cai operations, shock, burns, and hypop- 
roteinemia resulting in edema. Albumin is also 
used for diagnostic applications in which its 
nonspecific ability to bind to other proteins makes 
it useful in various diagnostic solutions. 

40 Presently, human serum albumin is produced 

from whole blood fractionation techniques, and 
thus is not available in large amounts at competi- 
tive costs. The application of recombinant DNA 
technology makes possible the production of 

45 copious amounts of human serum albumin by 
use of genetically directing microorganisms to 
produce it efficiently. The present invention may 
enable the availability of purified HSA produced 
through recombinant DNA technology more. 

so abundantly and at lower cost than is now 
presently possible. The present invention also 
provides knowledge of the DNA sequence 
organization of human serum albumin and its 
deduced amino acid sequence, helping to eluci- 

55 date the evolutionary, regulatory, and functional 
properties of human serum albumin as well as its 
related proteins such as a I ph a-fetoprotein. 

More particularly, present invention provides 
for the isolation of cDNA clones spanning the 

60 entire sequence of the protein coding and 3' 
untranslated portions of HSA mRNA. These cDNA 
clones were used to construct a recombinant 
expression vehicle which directed the expression 
in a microorganism strain of the mature HSA 

65 protein under control of the trp promoter. The 
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present invention also provides the complete 
nucleotide and deduced amino acid sequence of 
HSA. 

Reference herein to the expression of "mature 
human serum albumin" connotes the microbial 
production of human serum albumin unaccom- 
panied by the presequence ("prepro") that imme- 
diately attends translation of the human serum 
albumin mRNA. Mature human serum albumin, 
according to the present invention, is imme- 
diately expressed from a translation start signal 
(ATG), which also encodes the amino acid 
methionine (inked to the first amino acid of 
albumin. This methionine amino acid can be 
naturally cleaved by the microorganism so as to 
prepare the human serum albumin directly. 
Mature human serum albumin can be expressed 
together with a conjugated protein other than the 
conventional leader, the conjugate being speci- 
fically cleavable in an intra- or extracellular 
environment. See British patent publication 
number 2007676A. Finally, the mature human 
serum albumin can be produced in conjunction 
with a microbial signal polypeptide which trans- 
ports the conjugate to the cell wall, where the 
signal is processed away and the mature human 
serum albumin secreted. 

(B) Recombinant DNA technology 

With the advent of recombinant DNA tech- 
nology, the controlled microbial production of an 
enormous variety of useful polypeptides has 
become possible. Many mammalian polypep- 
tides, such as human growth hormone and 
human and hybrid leukocyte interferons, have 
already been produced by various microorgan- 
isms. The power of the technology admits the 
microbial production of an enormous variety of 
useful polypeptides, putting within reach the 
microbially directed manufacture of hormones, 
enzymes, antibodies, and vaccines useful for a 
variety of drug-targeting applications. 

A basic element of recombinant DNA tech- 
nology is the plasmid, an extrachromosomaf loop 
of double-stranded DNA found in bacteria often- 
times in multiple copies per cell. Included in the 
information encoded in the plasmid DNA is that 
required to reproduce the plasmid in daughter 
cells (i.e., a "replicon" or origin of replication) and 
ordinarily, one or more phenotypic selection 
characteristics, such as resistance to antibiotics, 
which permit clones of the host cell containing 
the plasmid of interest to be recognized and 
preferentially grown in selective media. The utility 
of bacterial plasmids lies in the fact that they can 
be specifically cleaved by one or another restric- 
tion endonuclease or "restriction enzyme", each 
of which recognizes a different site on the plasmid 
DNA. Thereafter heterologous genes or gene 
fragments may be inserted into the plasmid by 
endwise joining at the cleavage site or at recon- 
structed ends adjacent to the cleavage site. (As 
used herein, the term "heterologous" refers to a 
gene not ordinarily found in, or a polypeptide 
sequence ordinarily not produced by, a given 



microorganism, whereas the term "homologous" 
refers to a gene or polypeptide which is found in, 
or produced by the corresponding wild-type 
microorganism.) Thus formed are so-called 

5 repllcable expression vehicles. 

DNA recombination is performed outside the 
microorganism, and the resulting "recombinant" 
replicable expression vehicle, or plasmid, can be 
introduced into microorganisms by a process 

10 known as transformation and large quantities of 
the heterologous gene-containing recombinant 
vehicle obtained by growing the transformant. 
Moreover, where the gene is properly inserted 
with reference to portions of the plasmid which 

is govern the transcription and translation of the 
encoded DNA message, the resulting expression 
vehicle can be used to actually produce the 
polypeptide sequence for which the inserted gene 
codes, a process referred to as expression. 

20 Expression is initiated in a DNA region known 
as the promoter. In the transcription phase of 
expression, the DNA unwinds, exposing the 
sense coding strand of the DNA as a template for 
initiated synthesis of messenger RNA from the 5' 

25 to 3' end of the entire DNA sequence. The 
messenger RNA is, in turn, bound by ribosomes, 
where the messenger RNA is translated into a 
polypeptide chain having the amino acid 
sequence for which the DNA codes. Each amino 

30 acid is encoded by a nucleotide triplet or "codon" 
which collectively make up the "structural gene", 
i.e., that part of the DNA sequence which encodes 
the amino acid sequence of the expressed poly- 
peptide product. 

35 Translation is initiated at a "start" signal (ordi- 
narily ATG, which in the resulting messenger 
RNA becomes AUG). So-called stop codons, 
transcribed at the end of the structural gene, 
signal the end of translation and, hence, the 

40 production of further amino acid units. The result- 
ing product may be obtained by lysing the host 
cell and recovering the product by appropriate 
purification from other proteins. 
In practice, the use of recombinant DNA tech- 

45 noiogy can express entirely heterologous poly- 
peptides — so-called direct expression — or 
alternatively may express a heterologous poly- 
peptide, fused to a portion of the amino acid 
sequence of a homologous polypeptide. In the 

so latter cases, the Intended bioactive product is 
rendered bioinactive within the fused, homolog- 
ous/heterologous polypeptide until it is cleaved in 
an extracellular environment. See Wetzel, 
American Scientist 68, 664 (1980). 

55 If recombinant DNA technology is to fully sus- 

tain its promise, systems must be devised which 
optimize expression of gene inserts, so that the 
intended polypeptide products can be made avail- 
able in controlled environments and in high 

60 yields. 

(C) State of the art 

Sargent era/., in Proc. Nat/, Acad. Sci, (USA) 78, 
243 (1981), describe the cloning of rat serum 
65 albumin messenger RNA in a series of recombin- 
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ant DNA plasmids. This was done to determine 
the nucleotide sequences of the clones in order to 
study the evolutionary hypothesis of the protein 
product. Thus, these workers made no attempt to 
assemble the cDNA fragments they prepared. 

In Journal of Supramolecular Structure and 
Cellular Biochemistry, Supplement 5, 1981, Alan 
R. Liss f Inc. NY, Dugaiczyk eta/, report, in abstract 
form, their studies of the human gene for human 
serum albumin. They obtained cDNA fragments 
but there is no evidence that these workers cloned 
or produced by fragments for any purpose other 
than for studying the basic molecular biology of 
the a-fetoprotein and serum albumin genes. 

The present invention is based upon the discov- 
ery that recombinant DNA technology can be 
used to successfully and efficiently produce 
human serum albumin in direct form. The product 
is suitable for use in therapeutic treatment of 
human beings in need of supplementation of 
albumin. The product is produced by genetically 
directed microorganisms and thus the potential 
exists to prepare and isolate HSA in a more 
efficient manner that is presently possible by 
blood fractionation techniques. It is noteworthy 
that we have succeeded in genetically directing a 
microorganism to produce a protein of enormous 
length — 584 amino acids corresponding to an 
mRNA transcript upwards of about 2,000 bases. 

The present invention comprises the human 
serum albumin thus produced and the means and 
methods of its production. The present invention 
is further directed to replicable DNA expression 
vehicles harboring gene sequences encoding 
HSA in directly expressible form. Further, the 
present invention is directed to microorganism 
strains transformed with the expression vehicles 
described above and to microbial cultures of such 
transformed strains, capable of producing HSA. in 
still further aspects, the present invention is 
directed to various processes useful for preparing 
said HSA gene sequences, DNA expression 
vehicles, microorganism strains and cultures and 
to specific embodiments thereof. Still further, this 
invention is directed to the preparation of cDNA 
sequences encoding polypeptides which are 
heterologous to the microorganism host, such as 
human serum albumin, utilizing synthetic DNA 
primer sequences corresponding in sequence to 
regions adjacent to known restriction endonuc- 
iease sites, such that individual fragments of 
cDNA can be prepared which overlap in the 
regions encoding the common restriction 
endonuclease sites. This embodiment enables the 
precise cleavage and ligation of the fragments so 
as to prepare the properly encoded DNA 
sequence for the intended polypeptide. 

The work described herein involved the 
expression of human serum albumin (HSA) as a 
representative polypeptide which is heterologous 
to the microorganism employed as host. Likewise 
the work described involved use of the microor- 
ganism E coli K-12 strain 294 (end A, thi", hsr", 
k hsm + ), as described in British Patent Publication 
No. 2055382A. This strain has been deposited 



with the American Type Culture Collection, ATCC 
Accession No. 31446. 

The invention, in its most preferred embodi- 
ments, is described with reference to E coli, 

s including not only strain E coli K-12 strain 294, 
defined above, but also other known E coli 
strains such asE cofiB, E coiix 1776 and E co/i\N 
31 10, or other microbial strains many of which are 
deposited and (potentially) available from recog- 

w nized microorganism depository institutions, 
such as the American Type Culture Collection 
(ATCC) — cf. the ATCC catalogue listing. Seel also 
German Offenlegungsschrift 2644432. These 
other microorganisms include, for example, 

15 Bacilli such as Bacillus subtiiis and other 
enterobacteriaceae among which can be men- 
tioned as examples Salmonella typhimurium and 
Serratia marcesans, utilizing plasmids that can 
replicate and express heterologous gene 

20 sequences therein. Yeast, such as Saccharomyces 
cerevisiae, may also be employed to advantage 
as host organism in the preparation of the inser- 
tion proteins hereof by expression of genes cod- 
ing therefor under the control of a yeast pro- 

25 moter. (See the copending U.S. patent application 
of Hitzeman et al. t filed February 25, 1981 (Attor- 
ney Docket No. 100/43), assignee Genentech, Inc. 
era/., or the corresponding European Application 
82300949.3 which are incorporated herein by 

30 reference. 

Preferred embodiments of the invention will 
now be described with reference to the accom- 
panying drawings in which: 

Figs. 1A and B are diagrams for use in 

35 explaining the construction of plasmid pHSAI; 

Fig. 2 shows the immunoprecipitation of bacte- 
rialiy synthesised HSA; and 

Fig. 3 shows the amino acid sequence of HSA 
and the corresponding DNA sequence. 

40 in Fig. 1A, the top line represents the mRNA 
coding for the human serum albumin protein and 
below it the regions contained in the cDNA clones 
F-47, F-61 and B-44 described further herein. The 
initial and final amino acid codons of the mature 

45 HSA mRNA are indicated by circled 1 and 585 
respectively. Restriction enconuclease sites 
involved in the construction of pHSAI are shown 
by vertical lines. An approximate size scale is 
nucleotides is included. 

50 The completed plasmid pHSAI is shown in Fig. 
1B, with HSA coding regions derived from cDNA 
clones shaded as in A). Selected restriction sites 
and terminal codons number 1 and 585 are 
indicated as above. The E coli trp promoter- 

55 operator region is shown with an arrow 
representing the direction of transcription. G:C 
denotes an oligo dG:dC tail. The leftmost Xbal 
site and the initiation codon ATG were added 
synthetically. The tetracycline (Tc) and ampiciliin 

60 (Ap) resistance genes in the pBR322 portion of 
pHSAI are indicated by a heavy line. 

Figure 2 depicts the immunoprecipitation of 
bacteria lly synthesized HSA. 
E coli cells transformed with albumin express- 

65 ion plasmid pHSAI (lanes 4 and 5) or control 
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plasmid pLelFA25 (containing an interferon a gene 
in the identical expression vehicle; lanes 2, 3 and 7} 
were grown in 35 S-methionine-supplemented 
media. Samples in lanes 2, 4 and 7 were induced 
for expression from the trp promoter in M9 media 
Jacking tryptophan; samples in lanes 3 and 5 were 
grown in tryptophan-containing LB broth to rep- 
ress the trp promoter. Each sample lane of the 
autoradiograph of the SDS-polyacryiamide gel 
presented here contains labeled protein 
immunoprecipitated from 0.75 ml of cells at a 
density of A 550 =1. Lanes 1 and 6 contain radio- 
active protein standards (BRL) whose molecular 
weight in kilodaltons is indicated at the left. 
Bacterially synthesized HSA is seen in lane 4 
comigrating with the 68,000 d 14 C-labeled bovine 
serum albumin standards. Increased production of 
serum albumin in the induced versus repressed 
culture of pHSA1 represents higher levels of 
synthesis of plasmid encoded protein rather than a 
difference in 35 S-methionine pool specific 
activities for minimal versus rich media (data not 
shown). The sharp band at 60,000 d is an apparent 
artifact; this band is seen in both induced and 
repressed pHSAl and control transformants, and 
binds to preimmune (lane 7) as well as anti-HSA 
IgGs (lanes 2 — 5). The minor 47,000 d band in lane 
4 is apparently plasmid encoded and may 
represent a prematurely terminated form of bacte- 
rially synthesized HSA. 

Figure 3 depicts the nucleotide and amino acid 
sequence of human serum albumin. 

The DNA sequence of the mature protein coding 
and 3' untranslated regions of HSA mRNA were 
determined from the recombinant plasmid 
pHSAl. The DNA sequence of the prepro peptide 
coding and 5' untranslated regions were deter- 
mined from the plasmid P-14 (see text). Predicted 
amino acids are included above the ON A sequence 
and are numbered from the first residue of the 
mature protein. The preceding 24 amino acids 
comprise the prepro peptide. The five amino acid 
residues which disagree with the protein sequence 
of HSA reported by both Dayhoff (9) and Moulon et 
al. (12) are underlined. The above nucleotide 
sequence probably does not extend to the true 5' 
terminus of HSA mRNA. In the albumin direct 
expression plasmid pHSAl, the mature protein 
coding region is immediately preceded by the £. 
colt trp promoter-operator-leader peptide ribo- 
some binding site (36, 37), an artificial Xbal site r 
and an artificial initiation codon ATG; the prepro 
region has been excised. The nucleotides preced- 
ing HSA codon No. 1 in pHSAl read 

5'-TCACGTAAAAAG G GTATCTAG ATG . 

Detailed description 

(A) Synthesis and cloning of cDNA. 

Poly(A)+ RNA was prepared from quickly frozen 
human liver samples obtained by biopsy or from 
cadaver donors by either ribonucleoside-vanadyl 
complex (17) or guanidinium thiocyanate (18) 
procedures. cDNA reactions were performed 
essentially as described in (19) employing as 



primers either oligo-deoxynucleotides prepared 
by the phosphotriester method (20) or oligo 
(dT) 12 - 18 (Collaborative Research). For typical 
cDNA reactions 25 — 35 ug of poly(A)+ RNA and 

s 40 — 80 pmol of oligonucleotide primer were 
heated at 90° for 5 minutes in 50 mM NaCI. The 
reaction mixture was brought to final concen- 
trations of 20 m M Tris HCI pH 8.3, 20 mM KCI, 8 m M 
MgCl 2 , 30 mM dithiothreitol, 1 mM dATP, dCTP, 

10 dGTP, dTTP (plus 32 P-dCTP (Amersham) to follow 
recovery of product) and allowed to anneal at 42°C 
for 5'. 1 00 units of AMV reverse transcriptase (BRL) 
were added and incubation continued at 42° for 45 
minutes. Second strand DNA synthesis, SI treat- 

15 ment, size selection on polyacrylamide gels, deoxy 
(C) tailing and annealing to pBR322 which was 
cleaved with Pstl and deoxy (G) tailed, were 
performed as previously described (21, 22). The 
annealed mixture was used to transform E coti K- 

20 12 strain 294 (23) by a published procedure (24). 

(B) Screening of recombinant plasmids with 32 P- 
labelled Probes. 

E. coli transformants were grown on LB-agar 

25 plates containing 5 uxj/ml tetracycline, transferred 
to nitrocellulose filter paper (Schleicher and 
Schuell, BA85) and tested by hybridization using a 
modification of the in situ colony screening proce- 
dure (25). 32 P-end labelled (26) oligodeoxynuc- 

30 leotide fragments of from 12 to 16 nucleotides in 
length were used as direct hybridization probes, or 
32 P-cDNA probes were synthesized from RNA 
using oligo (dT) or oligodeoxynucieotide primers 
(19). Filters were hybridized overnight in 5X 

35 Denhardt's solution (27), SxSSC, (1xSSC=1.5M 
NaCI, 0.15 M Na Citrate) 50 mM Na phosphate pH 
6.8, 20 (ig/ml salmon sperm DNA at a temperatures 
ranging from 4° to 42° and washed in salt concen- 
trations varying from 1 to 0.2xSSC plus 0.1 

40 percent SDS at temperatures ranging from 4° to 
42° depending on the length of the 32 P-labelled 
probe (28). Dried filters were exposed to KodakXR- 
2 X-ray film using DuPont Lightning-Plus intensify- 
ing screens at -80°. 

45 

(C) DNA preparation and restriction enzyme analy- 
sis. 

Plasmid DNA was prepared in either large scale 
(29) or small scale ("miniprep"; 30) quantities and 
so cleaved by restriction endonucieases (New Eng- 
land Biolabs, BRL) following manufacturers con- 
ditions. Slab gel electrophoresis conditions and 
electroelution of DNA fragments from gels have 
been described (31). 

55 

(D) DNA sequencing. 

DNA sequencing was accomplished by both the 
method of Maxam and Gilbert (26) utilizing end- 
labelled DNA fragments and by dideoxy chain 
60 termination (32) on single stranded DNA from 
phage M13 mP7 subclones (33) utilizing synthetic 
oligonucleotide (20) primers. Each region was 
independently sequenced several times. 

65 
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(E) Construction of 5' end of ablumln gene for 
direct expression of HSA. 

10 \\g (-16 pmol) of the -1200 bp Pstl insert of 
plasmid F-47 was boiled in H 2 0 for 5 minutes and 
combined with 100 pmol of 32 P-end labelled 5' s 
primer ( d ATG G ATG CACACAAG ) . The mixture 
was quenched on ice and brought to a final 
volume of 120 ul of 6 mM Tris HCI pH 7.5, 6 mM 
MgCi 2 , 60 mM NaCI, 0.5 mM dATP, dCTP, dGTP, 
dTTP at 0°. 10 units of DNA polymerase I Klenow 10 
fragment (Boehringer-Mannheim) were added 
and the mixture incubated at 24° for 5 hr. Follow- 
ing phenol/chloroform extraction, the product 
was digested with Hpall, electrophoresed in a 5 
percent polyacrylamide gel, and the desired 450 15 
bp fragment eiectroeluted. the single stranded 
overhang produced by Xbal digestion of the 
vector plasmid pLelF A25 (21) was filled in to 
produce blunt DNA ends by adding deoxynuc- 
leoside triphosphates to 10 uM and 10 units DNA 20 
polymerase I Klenow fragment to the restriction 
endonuclease reaction mix and incubating at 12° 
for 10 minutes. Restriction endonuclease frag- 
ments (0.1 — 1 ug in approximate molar equality) 
were annealed and ligated overnight at 12° in 20 25 
ul of 50 mM Tris HCI pH 7.6, 10 mM MgCI 2 , 0.1 
mM EDTA, 5 mM dithiothreitol, 1 mM rATP with 
50 units T4 ligase (N.E. Biolabs). Further details of 
piasmid construction are discussed below. 

30 

(F) Protein analysis. 

Two ml cultures of recombinant E, coli strains 
were grown in either LB or M9 media plus 5 ug/ml 
tetracycline to densities of A 550 =1.0, pelleted, 
washed, repeileted, and suspended in 2 ml of LB 35 
or supplemented M9 (M9+0.2 percent glucose, 1 
ug/ml thiamine, 20 ug/ml standard amino acids 
except methionine was 2 ug/ml and tryptophan 
was excluded). Each growth medium also con- 
tained 5 ug/ml tetracycline and 100 uCi 3S S- 40 
methionine (NEN; 1200 Ci/mmol). After 1 hr incu- 
bation at 37°, bacteria was pelleted, freeze-thawed 
and resuspended in 200 ul 50 mM Tris HCI pH 7.5, 
0.12 mM NaEDTA then placed on ice for 10 
minutes following subsequent additions of 45 
lysozyme to 1 mg/ml, NP40 0.2 percent, and NaCI 
to 0.35 M. The lysate was adjusted to 10 mM 
MgCl 2 and incubated with 50 ug/ml DNase I 
(Worthington) on ice for 30 min. Insoluble 
material was removed by mild centrifugation. so 
Samples were immunoprecipitated with rabbit 
anti-HSA (Cappel Labs) and staphylococcal absor- 
bent (Pansorbtn; Cal Biochem) as described (34), 
and subjected to SDS polyacrylamide gel elec- 
trophoresis (35). 55 

(G> cDNA cloning. 

Initial cDNA clones primed with aligo (dT) were 
screened by colony hybridization with both total 
liver cDNA (to identify abundant RNA species eo 
containing clones) and with two 32 P-labelled 
cDNAs primed from liver mRNA by two sets of 
four 11 base oiigodeoxynucleotides synthesized 
to represent the possible coding variations for 
amino acids 546—549 and 294—297 of HSA. 65 



Positive colonies never contained more than 
about the 3' 1/2 of the protein coding region of the 
expected HSA mRNA sequence. (The longest of 
these recombinants was designated B-44). Since 
existing procedures were unable to directly copy 
an mRNA of the expected size (—2000 bp), syn- 
thetic oiigodeoxynucleotides were prepared to 
correspond to the antimessage strand at regions 
near the 5' extreme of B-44. From the nucleotide 
sequence of B-44, we constructed a 12 base 
oligodeoxynucieotide corresponding to amino 
acids 369—373. This was used to prime cDNA 
synthesis of liver mRNA and produce cDNA clo- 
nes in pBR322 containing the 5' portion of the 
HSA message while overlapping the existing B-44 
recombinant. Approximately 400 resulting clones 
were screened by colony hybridization with a 16 
base oligodeoxynucieotide fragment located 
slightly upstream in the mRNA sequence we had 
thus far determined. Approximately 40 percent of 
the colonies hybridized to both probes. Many of 
those colonies which failed to contain hybridizing 
plasmids presumably resulted from RNA self- 
priming or priming with contaminating oligo (dT) 
during reverse transcription, or lost the 3' region 
containing the sequence used for screening. 
"Miniprep" amounts of plasmid DNA from hyb- 
ridizing colonies were digested with Pstl . Three 
recombinant plasmids contained sufficiently 
large inserts to code for the remaining 5' portion 
of the HSA message. Two of these (F-15 and F-47) 
contained the extreme 5' coding portion of the 
gene but failed to extend back to a Pstl site 
necessary for joining with B-44 to reform the 
complete albumin gene. Recombinant F-61 
possessed this site but lacked the 5' extreme end. 
A three part reconstruction of the entire message 
sequence was possible employing restriction 
endonuclease sites in common with the part 
length clones F-47, F-61 and B-44 (Fig. 1). 

An additional cDNA clone extending further 5' 
was obtained by similar oligodeoxynucieotide 
primed cDNA synthesis (from a primer corre- 
sponding to amino acid codon No. 175 — 179). 
Although not employed in the construction of the 
mature HSA expression plasmid, this cDNA clone 
(P-14) allowed determination of the DNA 
sequence of the "prepro" peptide coding and 5' 
non-coding regions of the HSA mRNA. 

The mature HSA mRNA sequence was joined to 
a vector plasmid for direct expression of the 
mature protein in £ coli via the trp promoter- 
operator. The plasmid pLelF A25 directs the 
expression of human leukocyte interferon A 
(IFNa2) (21). It was digested with Xbal and the 
cleavage site "filled in" to produce blunt DNA 
ends with DNA polymerase I Klenow fragment 
and deoxynucleoside triphosphates. After subse- 
quent digestion with Pstl. a "vector" fragment 
was gel purified that contained pBR322 
sequences and a 300 bp fragment of the E coli trp 
promoter, operator, and ribosome binding site of 
the trp leader peptide terminating in the artifi- 
cially blunt ended Xbal cleavage site. A 15 base 
oligodeoxynucieotide was designed to contain 
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the initiation codon ATG followed by the 12 
nucleotides coding for the first four amino acids 
of HSA as determined by DNA sequence analysis 
of clone F-47. In a process referred to as "primer 
repair", the gene-containing Pstl fragment F-47 s 
was denatured, annealed with excess 15-mer and 
reacted with DNA polymerase I Klenow fragment 
and deoxynucieoside triphosphates. This reaction 
extends a new second strand downstream from 
the annealed oligonucleotide, degrades the single m 
stranded DNA upstream of codon number one 
and then polymerizes upstream three nucleotides 
complementary to ATG. In addition, when this 
product is blunt-end ligated to the prepared vec- 
tor fragment, its initial adenosine residue 75 
recreates an Xbai restriction site. Following the 
primer repair reaction, the DNA was digested with 
Hpall and a 450 bp fragment containing the 5' 
portion of the mature albumin gene was gel 
purified (see Fig. 1). This fragment was annealed 20 
and ligated to the vector fragment and to the gel 
isolated Hpall to Psti portion of F-47 and used to 
transform £. coii cells. Diagnostic restriction 
endonuclease digests of plasmid minipreps iden- 
tified the recombinant A-26 which contained the 25 
5' portion of the mature albumin coding region 
ligated properly to the trp promoter-operator. For 
the final steps in assembly, the A-26 plasmid was 
digested with BgPII plus Pstl and the —4 kb 
fragment was gel purified. This was annealed and 30 
ligated to a 390 bp Pstl, Bglll partial digestion 
fragment purified from F-61 and a 1000 bp Pstl 
fragment of B-44. Restriction endonuclease analy- 
sis of resulting transformants identified plasmids 
containing the entire HSA coding sequence 35 
properly aligned for direct expression of the 
mature protein. One such recombinant plasmid 
was designated pHSA1. When E coii containing 
pHSA1 is grown in minimal media lacking tryp- 
tophan, the ceils produce a protein which speci- 40 
fically reacts with HSA antibodies and comigrates 
with HSA in SDS polyacrylamide electrophoresis 
(Fig. 2). No such protein is produced by identical 
recombinants grown in rich broth, implying that 
production in £, coii of the putative HSA protein is 45 
under control of the trp promoter-operator as 
designed. To insure the integrity of the HSA 
structural gene in the recombinant plasmid, 
pHSA1 was subject to DNA sequence analysis. 

50 

(H) DNA sequence analysis 

The albumin cDNA portion (and surrounding 
regions) of pHSA1 were sequenced to completion 
by both the chemical degradation method of 
Maxam and Giblert (26) and the dideoxy chain 55 
termination procedure employing templates 
derived from single stranded M13 mP7 phage 
derivatives (32, 33). All nucleotides were 
sequenced at least twice. The DNA sequence is 
shown in Fig. 3 along with the predicted amino 60 
acid sequence of the HSA protein. The DNA 
sequence farther 5' to the mature HSA coding 
region was also determined from the cDNA clone 
P-14 and is included in Fig. 3. 

DNA sequence analysis confirmed that the arti- 65 



ficial initiation codon and the complete mature 
HSA coding sequence directly follows the E. coii 
trp promoter-operator as desired. The ATG 
initiator follows the putative £ coii ribosome 
binding sequence (36) of the trp leader peptide 
(37) by 9 nucleotides. 

Translation of the DNA sequence of pHSA1 
predicts a mature HSA protein of 585 amino acids. 
Various published protein sequences of HSA dis- 
agree at about 20 amino acids. The present 
sequence differs by eleven residues from Moulon 
etai. (12), and by 28 residues from that reported 
in the Dayhoff catalogue (9) credited as arising 
primarily from Behrens et al. (10) with contribu- 
tions by Moulon et ai. (12). Most of these 
differences represent inversions of pairs of adja- 
cent residues or glutamine-glutamic acid disag- 
reements. Only at five of the 585 residues does 
our sequence differ from the residue reported by 
both Dayhoff (9) and Moulon etai. (12), and three 
of these five differences represent glutamine- 
glutamic acid interchanges (underlined in Figure 
3). At all discrepant positions the nucleotide 
sequencing has been carefully rechecked and it is 
unlikely that DNA sequencing errors are the cause 
of these reported differences. The possibility of 
artifacts introduced by cDNA cloning cannot be 
ruled out. However, other likely explanations exist 
for the amino acid sequence differences among 
various reports. These include changes in amida- 
tion (affecting glutamine-glutamic acid discrimi- 
nation) occurring either in vivo or during protein 
sequencing (38). Polymorphism in HSA proteins 
may also account for some differences; over 
twenty genetic variants of HSA have been 
detected by protein electrophoresis (39) but have 
not yet been analyzed at the amino acid sequence 
level. It is also worth noting that our predicted 
HSA protein sequence is 585 amino acids long, in 
agreement with Moulon (12) but not Dayhoff (9). 
The difference is accounted for by the deletion (in 
ref. 9) of one phenylalanine (Phe) residue in a Phe- 
Phe pair at amino acids 156 — 157. 

When compared to the DNA sequence of a rat 
serum albumin cDNA CLONE (16) the present 
mature HSA sequence shares 74 percent homol- 
ogy at the nucleotide and 73 percent homology at 
the amino acid level. (The rat SA protein is one 
amino acid shorter than HSA; the carboxy ter- 
minal residue of HSA is absent in the rat protein.) 
All 35 cysteine residues are located in identical 
positions in both proteins. The predicted "pre- 
pro" peptide region of HSA shares 76 percent 
nucleotide and 75 percent amino acid homology 
with that reported from the rat cDNA clone (16). 
Interspecies sequence homology is reduced in the 
portion of the 3' untranslated region which can be 
compared (the published rat cDNA clone ends 
before the 3' mRNA terminus). The HSA cDNA 
contains the hexanucleotide AATAAA 28 nuc- 
leotides before the site of poly (A) addition. This is 
a common feature of eukaryotic mRNAs first 
noted by Proudfoot and Brownlee (40). 
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Pharmaceutical compositions 

The compounds of the present invention can be 
formulated according to known methods to pre- 
pare pharmaceutically useful compositions, 
whereby the polypeptide hereof is combined in 
admixture with a pharmaceutically acceptable 
carrier vehicle. Suitable vehicles and their formu- 
lation are described in Remington's Pharmaceuti- 
cal Sciences by E. W. Martin, which is hereby 
incorporated by reference. Such compositions 
will contain an effective amount of protein hereof 
together with a suitable amount of vehicle in 
order to prepare pharmaceutically acceptable 
compositions suitable for effective administration 
to the host. One preferred mode of administration 
is parenteral. 
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Claims for the Contracting States: BE, CH, DE, FR, 
GB, IT, LI, LU, NL SE 

1. A DNA isolate comprising a continuous 
sequence encoding human serum albumin of the 
amino acid sequence depicted in Fig. 3 hereof and 
genetic variants thereof. 

2. An expression vehicle comprising a DNA 
coding sequence according to claim 1 operably 
linked with a DNA vector capable of effecting the 
microbial expression of said sequence so as to 
prepare the corresponding human serum albu- 
min. 

3. A microorganism transformed with the 
vehicle according to claim 2. 

4. The microorganism according to claim 3, 
obtained by transforming an £ coli bacterial or a 
yeast strain. 

5. A process which comprises microbiaily 
expressing human serum albumin of the amino 
acid sequence depicted in Fig. 3 hereof and 
genetic variants thereof. 

6. A process which comprises expressing 
human serum albumin in a transformed microor- 
ganism of claim 3 or claim 4. 

7. A process of claim 5 or claim 6 which further 
includes using the human serum albumin to 
prepare a pharmaceutical composition for the 
therapeutic treatment of humans. 

Claims for the Contracting State: AT 

1. A method which comprises the preparation 
of a DNA isolate comprising a continuous 
sequence encoding human serum albumin of the 
amino acid sequence depicted in Fig. 3 hereof and 
genetic variants thereof. 

2. A method which comprises the preparation 
of an expression vehicle comprising a DNA cod- 
ing sequence according to claim 1 operably linked 
with a DNA vector capable of effecting the 
microbial expression of said sequence so as to 
prepare the corresponding human serum albu- 
min. 

3. A microorganism transformed with the 
expression vehicle as defined in claim 2. 

4. The microorganism according to claim 3, 
obtained by transforming an £. coli bacterial or a 
yeast strain. 

5. A process which comprises microbiaily 
expressing human serum albumin of the amino 
acid sequence depicted in Fig. 3 hereof and 
genetic variants thereof. 

6. A process which comprises expressing 
human serum albumin in a transformed microor- 
ganism of claim 3 or claim 4. 

7. A process of claim 4 or claim 6 which further 
includes using the human serum albumin to 
prepare a pharmaceutical composition for the 
therapeutic treatment of humans. 



Patentanspruche fur die Vertragsstaaten: BE CH 
DE FR GB IT LI LU NL SE 

1. DNA-lsolat umfassend eine kontinuierliche 
5 Sequenz kodierend fur menschliches Serum-Alb- 
umin der in Fig. 3 der vorliegenden Anmeldung 
dargestellten Aminosauresequenz und geneti- 
sche Varianten desselben. 

2. Expressionsvehikel umfassend eine DNA- 
70 Kodierungsequenz nach Anspruch 1 , die betriebs- 

fahig an einen DNA-Vektor gebunden ist, der 
fahig ist, die mikrobielle Expression der ge- 
nannten Sequenz zu bewirken, um das ent- 
sprechende menschliche Serum-Albumin her- 
15 zustellen. 

3. Mikroorganismus, der mit den Vehikei nach 
Anspruch 2 transformiert ist. 

4. Mikroorganismus nach Anspruch 3, der 
durch Transformieren eines E. coli Bakterien- 

20 oder eines Hefestammes erhalten wird. 

5. Verfahren umfassend die mikrobielle Expres- 
sion von menschlichem Serum-Albumin mit der 
in Fig. 3 der vorliegenden Anmeldung dargestell- 
ten Aminosauresequenz und genetische Varian- 
ts ten desselben. 

6. Verfahren umfassend die Expression von 
menschlichem Serum-Albumin in einem transfor- 
mierten Mikroorganismus nach Anspruch 3 oder 
4. 

30 7. Verfahren nach Anspruch 5 oder 6, das 
weiters die Verwendung des menschlichen 
Serum-Albumins zur Herstellung einer pharma- 
zeutischen Zusammensetzung fur die therapeuti- 
sche Behandlung des Menschen umfafit. 

35 

Patentanspruche fur den Vertagsstaat: AT 

1. Verfahren umfassend die Herstellung eines 
DNA-IsolateSr das eine kontinuierliche Sequenz 

40 kodierend fur menschliches Serum-Albumin der 
in Fig. 3 der vorliegenden Anmeldung dargestell- 
ten Aminosauresequenz und genetische Varian- 
ten desselben umfaBt. 

2. Verfahren umfassend die Herstellung eines 
45 Expressionsvehikels, das eine fur DNA kodie- 

rende Sequenz nach Anspruch 1 umfafct, die 
betriebsfahig mit einem DNA-Vektor verbunden 
ist, der fahig ist, die mikrobielle Expression der 
genannten Sequenz zu bewirken, um das entspre- 
so chende menschliche Serum-Albumin herzustel- 
len. 

3. Mikroorganismus, der mit dem in Anspruch 2 
definierten Expressionsvehikel transformiert ist. 

4. Mikroorganismus nach Anspruch 3, der 
55 durch Transformieren eines E. coli-Bakterien- 

oder eines Hefestammes erhalten wird. 

5. Verfahren umfassend die mikrobielle Expres- 
sion von menschlichem Serum-Albumin der in 
Fig. 3 der vorliegenden Anmeldung dargestellten 

60 Aminosauresequenz und genetischer Varianten 
desselben. 

6. Verfahren umfassend die Expression von 
menschlichem Serum-Albumin in einem transfor- 
mierten Mikroorganismus nach Anspruch 3 oder 

65 4. 
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7. Verfahren nach Anspruch 5 oder 6, weiters 
umfassend die Verwendung des menschlichen 
Serum-Albumins zur Hersteilung einer pharma- 
zeutischen Zusammensetzung fur die therapeuti- 
sche Behandlung des Menschen. 

Revendications pour les Etats contractants: BE 
CH DE FR GB IT LI LU NL SE 

1. Produit isole d'ADN comprenant une 
sequence continue codant pour la serum-albu- 
mine humaine de la sequence d'acides amines 
presentee a ia figure 3 et ses variants genetiques. 

2. Vehicule d'expression comprenant une 
sequence d'ADN de codage selon la revendica- 
tion 1 Ii6 de fagon operatoire avec un vecteur 
d'ADN capable d'accomplir I'expression micro- 
biologique de ladite sequence de fagon a preparer 
la se>um-albumine humaine correspondante. 

3. Micro-organisme transforme avec la vehicule 
selon la revendication 2. 

. 4. Micro-organisme selon la revendication 3, 
obtenu par transformation d'une souche bact6- 
rienne de E coli ou d'une souche de levure. 

5. Procede qui comprend I'expression micro- 
biologique de la serum-albumine humaine de la 
sequence d'acides amines presentee a la figure 3 
et de ses variants genetiques. 

6. Procede qui comprend I'expression de la 
serum-albumine humaine dans un micro-orga- 
nisme transforme selon la revendication 3 ou la 
revendication 4. 

7. ProcedS selon la revendication 5 ou la reven- 
dication 6, qui comprend en outre I'utilisation de 
la serum-albumine humaine pour preparer une 



composition pharmaceutique pour le traitement 
therapeutique des humains. 

Revendications pour I'Etat contractant: AT 

5 

1. Proc6d6 qui comprend ia preparation d'un 
produit isole d'ADN comprenant une sequence 
continue codant pour la serum-albumine 
humaine de la sequence d'acides amines presen- 

w tee a la figure 3 et ses variants genetiques. 

2. Procede qui comprend la preparation d'un 
vehicule d'expression comprenant une sequence 
d'ADN de codage selon ia revendication 1 lie de 
fagon operatoire a un vecteur d'ADN capable 

is d'effectuer I'expression microbiologique de ladite 
sequence de fagon a preparer ia serum-albumine 
correspondante. 

3. Micro-organisme transforme avec le vehicule 
d'expression tel que defini a la revendication 2. 

20 4. Micro-organism selon ia revendication 3, 

obtenu par transformation d'une souche bacte- 
rienne de E coli ou d'une souche de levure. 

5. Procede qui comprend I'expression micro- 
biologique de la serum-albumine humaine de la 

25 sequence d'acides amines representee a la figure 
3 et de ses variants g£nerique. 

6. Procede qui comprend ('expression de la 
serum-albumine humaine dans un micro-orga- 
nisme transform^ selon la revendication 3 ou la 

30 revendication 4. 

7. ProcedS selon la revendication 5 ou ia reven- 
dication 6 qui comprend en outre i'utilisation de la 
serum-albumine humaine pour preparer une 
composition pharmaceutique pour le traitement 

35 therapeutique des humains. 
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ACGATGTC7TC7GGCAATnCATATAASTATTTTTTCAAAAATGTCTCTTCT6TCAACCCCACGCCTTTGCC 

(Prtpro) 

Htt Lys Trp Val Thr Pht II • Ser Leu Leu Phe Itu Phe Ser Ser Ala Tyr Str Arg Gly Val Pht Arg Arg 

ACA ATG AAG TGG GTA ACC TTT ATT TCC CTT CTT TTT CTC TTT ACC TCfi GCT TAT TCC AGS GGT GTG TTT CGT CGA 

1 (Mature] 

Asp Ale H1$ Lys Ser Glu Val Alt Ms Arg Pht Lys Asp Leu Gly Glu Glu Asn Pht Lys Ala Ltu Ytl Itu lit 
GAT GCA CAC AM AGT SAG GTT GCT CAT CGG TTT AAA GAT TTG GGA 6AA GAA AAT TTC AAA GCC TTG GTG TTG ATT 

50 

Alt Pht Alt Gin Tyr Leu Gin Gin Cys Pro Pht Glu Asp Mis Vtl Lys Ltu Vtl Asn Glu Val Thr Glu Pht Ala 
GCC TTT GCT CAG TAT CTT CAG CAG TGT CCA TTT GAA GAT CAT GTA AAA TTA GTG AAT GAA GTA ACT GAA TTT GCA 

Lys Thr Cys Vtl Alt Asp Glu Str AU Glu Asn Cys Asp Lys Str Ltu His Thr Leu Pht S1y Asp Lys Leu Cys 
AAA ACA TGT GTA GCT GAT GAG TCA GCT GAA AAT T6T GAC AAA TCA CTT CAT ACC CTT TTT GGA GAC AAA TTA TGC 

100 

Thr Yi1 Alt Thr Leu Arg Glu Thr Tyr Gly Glu Htt Ala Asp Cys Cys All Lys Gin Glu Pro Glu An Asn Glu 
ACA GTT GCA ACT CTT CGT GAA ACC TAT GGT GAA ATG GCT GAC TGC TGT GCA AAA CAA GAA CCT GAG AGA AAT GAA 

Cys Pht Ltu Gin His Lys Asp Asp Asn Pro Asn Ltu Pro Arg Ltu Vtl Arg Pro Glu Vtl Asp Vtl Htt Cys Thr 
TGC TTC TTG CAA CAC AAA GAT GAC AAC CCA AAC CTC CCC CGA TTG GTG AGA CCA GAG GTT GAT GTG ATG TGC ACT 

150 

Alt Phe Hfs Asp Asn Glu Glu Thr Pht Leu Lys Lys Tyr Ltu Tyr Glu lie Ala Arg Arg His Pro Tyr Pht Tyr 
GCT TTT CAT GAC AAT GAA GAG ACA TTT TTG AAA AAA TAC TTA TAT GAA ATT GCC AGA AGA CAT CCT TAC TTT TAT 

Alt Pro Glu Ltu Ltu Pht Pht Alt Lys Arg T vr Lys Ala Ala Pht Thr Glu Cys Cys Gin Ala Ala Asp Lys Ala 
GCC CCG GAA CTC CTT TTC TTT GCT AAA AGG TAT AAA GCT GCT TTT ACA GAA TGT TGC CAA 6CT GCT GAT AAA GCT 

200 

Ala Cys Ltu Ltu Pro Lys Ltu Asp Glu Ltu Arg Asp Glu Gly Lys Ala Str Str Alt Lys Gin Arg Ltu Lys Cys 
GCC TGC CTG TTG CCA AAG CTC GAT GAA CTT CGG GAT GAA GGG AAG GCT TCG TCT GCC AAA CAG AGA CTC AAA TGT 

Ala Str Leu Gin Lys Pht Gly Glu Arg Alt Pht Lys Ala Trp Ala Val Ala Arg Ltu Str Gin Arg Phe Pro Lys 
GCC AGT CTC CAA AAA TTT GGA GAA AGA GCT TTC AAA GCA TGG GCA GTG GCT CGC CTG AGC CAG AGA TTT CCC AAA 

250 

Alt Glu Pht Ala Glu Val Str Lys Ltu Val Thr Asp Leu Thr Lys Val His Thr Glu Cys Cys His Gly Asp Ltu 
GCT GAG TTT GCA GAA GTT TCC AAG TTA GTG ACA GAT CTT ACC AAA GTC CAC ACS GAA TGC TGC CAT GGA GAT CTG 

Leu Glu Cys Ala Asp Asp Arg Alt Asp Ltu Alt Lys Tyr lit Cys Glu Asn Gin Asp Str lit Str Str Lys Ltu 
CTT GAA TCT GCT GAT GAC AGC GCG GAC CTT GCC AAG TAT ATC TGT GAA AAT CAG GAT TCG ATC TCC AGT AAA CTG 

300 

Lys Glu Cys Cys Glu Lys Pro Ltu Ltu Glu Lys Str His Cys lit Ala Glu val Glu Asn Asp Glu Met Pro Ala 
AAG GAA TGC TGT GAA AAA CCT CTG TTG GAA AAA TCC CAC TGC ATT GCC GAA GTG GAA AAT GAT GAG ATG CCT GCT 

Asp Leu Pro Str Ltu Ala Ala Asp Pht Val Glu Str Lys Asp Val Cys Lys Asn Tyr Ala Glu Ala Lys Asp Vtl 
GAC TTG CCT TCA TTA GCT GCT GAT TTT GTT GAA AGT AAG GAT GTT TGC AAA AAC TAT GCT GAG GCA AAG GAT GTC 

350 

Pht Leu Gly Htt Pht Ltu Tyr Glu Tyr Alt Arg Arg His Pro Asp Tyr Str Val Vtl Ltu Ltu Ltu Arg Ltu Ala 

TTC CTG GGC ATG TTT TTG TAT GAA TAT GCA AGA AGG CAT CCT GAT TAC TCT GTC GTG CTG CTG CTG AGA CTT GCC 

Lys Thr Tyr Glu Thr Thr Ltu Glu Lys Cys Cys Ala Alt Ala Asp Pro His Glu Cys Tyr Alt Lys Vtl Pht Asp 
AAG ACA TAT GAA ACC ACT CTA GAG AAG TGC TGT GCC GCT GCA GAT CCT CAT GAA TGC TAT GCC AAA GTG TTC GAT 

400 

Glu Pht Lys Pro Leu Val Glu Glu Pro G1n Asn Ltu lit Lys Gin Asn Cys Glu Ltu Pht Lys Gin Ltu Gly Glu 
GAA TTT AAA CCT CTT GTG GAA GAG CCT CAG AAT TTA ATC AAA CAA AAC TGT GAG CTT TTT AAG CAG CTT GGA GAG 

Tyr Lys Pht Gin Asn Ala Leu Leu Val Arg Tyr Thr Lys Lys Val Pro Gin Val Str Thr Pro Thr Ltu Vtl Glu 
TAC AAA TTC CAG AAT GCG CTA TTA GTT CGT TAC ACC AAG AAA GTA CCC CAA GTG TCA ACT CCA ACT CTT GTA GAG 

450 

Vtl Str Arg Asn Leu Gly Lys Vtl Gly Str Lys Cys Cys Lys His Pro Glu Alt Lys Arg Htt Pro Cys Ala Glu 
GTC TCA AGA AAC CTA GGA AAA GTG GGC AGC AAA TGT TGT AAA CAT CCT GAA GCA AAA AGA ATG CCC TGT GCA GAA 

Asp Tyr Ltu Str Vtl Vtl Ltu Asn Gin Ltu Cys Vtl Leu His Glu Lys Thr Pro Vtl Str Asp Arg Val Thr Lys . 
GAC TAT CTA TCC GTG GTC CTG AAC CAG TTA TGT GTG TTG CAT GAG AAA ACQ CCA GTA AGT GAC AGA GTC ACA AAA 

500 

Cys Cys Thr Glu Str Ltu Vtl Asn Arg Arg Pro Cys Pht Str Alt Ltu G1u Val Asp Glu Thr Tyr Val Pro Lys 
TGC TGC ACA SAG TCC TTG GTG AAC AGG CGA CCA TGC TTT TCA GCT CTG GAA GTC GAT GAA ACA TAC GTT CCC AAA 

Glu Pht Asn Ala Glu Thr Pht Thr Pht His Ala Asp lit Cys Thr Ltu Str Glu Lys Glu Arg 61 n lit Lys Lys 
GAG TTT AAT GCT GAA ACA TTC ACC TTC CAT GCA GAT ATA TGC ACA CTT TCT GAG AAG GAG AGA CAA ATC AAG AAA 

550 

Gin Thr Ala Ltu Val Glu Leu Val Lys His Lys Pro Lys Ala Thr Lys Glu Gin Ltu Lys Ala Vtl Net Asp Asp 
CAA ACT GCA CTT GTT GAG CTT GTG AAA CAC AAG CCC AAG GCA ACA AAA GAG CAA CTG AAA GCT GTT ATG GAT GAT 

Pht Alt Ala Pht Val Glu Lys Cys Cys Lys Ala Asp Asp Lys Glu Thr Cys Pht Ala 6Tu Glu Gly Lys Lys Leu 
TTC GCA GCT TTT GTA GAG AAG TGC TGC AAG GCT GAC GAT AAG GAG ACC TGC TTT GCC GAG GAG GGT AAA AAA CTT 

Val Ala Ala Ser Gin Ala Ala Ltu Gly Ltu End 

GTT GCT GCA AGT CAA GCT GCC TTA GGC TTA TAA C ATC TAC ATTT AAAAGCATC TCAGCCTACC ATGAGAATAAGAGAAAGAAAATGAA 
GATCAAAACCTTATTCATCTGTTTTCTTTTTCGTTte 

GTGCTTCAATTAATAAAAMTGGAAAGAATCTAATAGAGT6GTACAGCACTGTTATTTTTCAA 

TGfiAAGTTCCAGTCTTCTCTCTTATTCCACTTCGGTAGAG Pol y ( A ) 
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